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@ A method of producing a polypeptide product end a plssmidlc expression vehicle therefor, a method of creating 
expression plasmid, a method of cleaving double stranded DMA, and specific plasmids. 

@ Novel plasmidic expression vehicles and methods of 
using them in the production of useful polypeptides by 
recombinant bacteria are described. The plasmids employ a 
tryptophan promoter-opGrator system from which the 
attenuator region ordinarily present has been deleted. Bac- 
teria containing, the plssmids can accordingly be repressed 
by the addition of tr\'plophan against expression of desired 
polypeptides coded for by inserted genes white Ihey are 
grown to (evols suitable for industrial-scale production. 

CJ Additive tryptophan may then be withdrawn, essentially 

^ derepressing the pathway and permitting efficient produc- 
tion of the desired product in high yield. 
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A METHOD OF PRCXXTING A POLYPEPTIIE PROXJCT 
AND A PIASMIDIC EXPRESSigj VEHICLE THEREFOR, 
A METHOD OF CREATING AN EXPRESSION PIASMID, 
A METHOD OF CLEAVING DOUBLE STRAt^DED DMA, 
AND SPECIFIC PIASMIDS. 



BACKGROUND OF THE INVENTION 

• With the advent of recombinant DNA technology, the controlled 
bacterial production of an enormous variety of useful polypeptides has. 
become possible. Already in hand are bacteria modified by this 
technology to permit the production of such polypeptide products such as 
somatostatin (K. Itakura, et _al_. , Science J^, 1056 [1977]), the 
(component) A and B chains of human insulin (D.V. Goeddel, et aj,. , Proc 
Nat'l Acad Sci, USA 25, 105 [1979]), and human growth hormone (O.V. 
Goeddel, et , Nature 281. 544 [1979]). More recently, recombinant 
OiNA techniques have been used to occasion the bacterial production of 
thymosin alpha 1, an immune potentiating substance produced by the 
thymus. Such is the power of the technology that virtually 
any useful polypeptide can be bacterially produced, putting 
within reach the controlled manufacture of hormones, 
enzymes, antibodies, and vaccines against a wide variety 
of diseases. The cited materials, which describe 
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in greater detail the representative examples referred to above, are 
incorporated herein by reference, as are other publications referred to 
Infra , to illuminate the background of the invention. 

The work horse of recombinant DNA technology is the plasmid, a 

5 -non-chromosomal loop of double-stranded DNA found in bacteria> 
oftentimes in multiple copies per bacterial cell. Included in the 
infiarmation encoded in the.plasmid DNA is that required to reproduce the 
plasraid in daughter cells (i.e., a 'Veplicon") and ordinarily, one or 
nrare selection characteristics, such as resistance to antibiotics, which 

10 perrait clones of the host cell containing the plasmid of interest to be 
recognized and preferentially grown in selective media. The utility of 
bacterial plasmids lies in the fact that they can be specifically 
cleaved by one or another restriction endonuclease or "restriction 
enzyme", each of which recognizes a different site on the plasmidic 

15 DMA. Thereafter heterologous genes or gene fragments may be inserted 
into the plasmid by endwise joining at the cleavage site or at 
reconstructed ends adjacent 'the cleavage site. As used herein, the term- 
"heterologous" refers to a gene not ordinarily found in, or a 
polypeptide sequence ordinarily not produced by, E^. col i , whereas the 

20 term "homologous" refers to a gene or polypeptide which is produced in 
wild-type E. coli . ONA recombination is performed outside the bacteria, 
but the resulting "recombinant" plasmid can be introduced into bacteria 
by a process known as transformation and large quantities of the 
heterologous gene-containing recombinant plasmid obtained by growing the 

25 transformant. Moreover, where the gene is properly inserted with 

reference to portions of the plasmid which govern the transcription and 
translation of the encoded DNA message, the resulting expression vehicle 
can be used to actually produce the polypeptide sequence for which the 
inserted gene codes, a process referred to as expression. 

30 Expression is initiated in a region known as the promoter which is 

reccgni/^ed by and bound by RNA polymerase. In some cases, as in the trp 
operon discussed infra , promoter regions are overlapped by "operator" 
regions to form a combined promoter-operator. Operators are DNA 
sequences which ire recognized by so-called repressor proteins which 

35 serve to regulate the frequency of transcription initiation at a 
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particular promoter. The polymerase travels along the DNA, transcribing 
the inforrtation contained in the coding strand from its 5* to 3* end 
into messenger RNA which is in turn translated into a polypeptide having 
the amino acid sequence for which the ONA codes. Each amino acid is 

5 ' encoded by a unique nucleotide triplet or "codon" within what ^^y for 
present purposes be referred to as the "structural gene", i.e. that part 
*?hich encodes the amino acid sequence of the expressed product. After 
binding to the promoter, the RNA polymerase first transcribes 
nucleotides encoding a ribosome binding site, then a translation 

10 initiation or "start" signal (ordinarily ATG, which in the resulting 
messenger RNA becomes AUG), then the nucleotide codons within the 
structural gene itself- So-called stop codons are transcribed at the 
end of the structural gene whereafter the polymerase may form an 
additional sequence of messenger RNA which, because of the presence of 

15 the stop signal, will remain untranslated by the ribosomes. Ribosomes 
bind to the binding site provided on the messenger RNA, in bacteria 
ordinarily as the mRNA is being formed, and themselves produce the 
encoded polypeptide, beginning at the translation start signal and 
ending at the previously mentioned stop signal. The desired product is 

20 produced if the sequences encoding the ribosome binding site are 

positioned properly with respect to the AUG initiator codon and if all 
remaining codons follow the initiator codon in phase. The resulting 
product may be obtained by lysing the host cell and recovering the 
product by appropriate purification from other bacterial protein. 

25 Polypeptides expressed through the use of recombinant DNA 

technology may be entirely heterologous, as in the case of the direct 
expression of human growth hormone, or alternatively may comprise a 
heterologous polypeptide and, fused thereto, at least a portion of the 
amino acid sequence of a homologous peptide, as in the case of the 

30 production of intermediates for somatostatin and the components of human 
insulin, in the latter cases, for example, the fused homologous 
polypeptide comprised a portion of the amino acid sequence for beta 
-..galactos idase. In those cases, the intended bioactive product is 
bioinacti vated by the fused, homologous polypeptide until the latter is 

35 cleaved away in an extracellular environment. Fusion proteins like 
those just mentioned can be designed so es to permit highly scecif'ic 
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cleavage of the precusor protein from the intended product, as by the 
action of cyanogen bromide on methionine, or alternatively by enzymatic 
cleavage. See, eg., G.B. Patent Publication No. 2 007 675 A. 

If recombinant DMA technology is to fully sustain its promise, 
5 systems must be devised which optimize expression of gene inserts, so 
that the intended -polypeptide products can be made available in high 
yield. The beta lactamase and lactose promoter-operator systems most 
commonly used in the past, while useful, have not fully utilized the 
capacity of the technology from the standpoint of yield, A need has 
10 -existed for a, bacterial expression vehicle capable of the controlled 
expression of desired polypeptide products in higher yield. 

Tryptophan is an amino acid produced by bacteria for use as a 
component part of homologous polypeptides in a biosynthetic pathway 
which proceeds: chorismic acid anthranil ic acid-^'phosphoribosyl 

15 anthranili: acid — ^CDRP [enol-l-(o-carboxyphenylamino)-l-desoxy-D- 
ribulose-5-phosphate]-^ indol-3-g1ycerol-phosphate, and ultimately to 
tryptophan itself. The enzymatic reactions of this pathway are 
catalyzed by the products of the tryptophan or "trp" operon, a 
polycistronic ONA segment which is transcribed under the direction of 

20 the trp promoter-operator system. The resulting polycistronic messenger 
RNA encodes the so-called trp leader sequence and then, in order, the 
polypeptides referred to as trp E, trp 0, trp C, trp B and trp A. These 
polypeptides variously catalyze and control individual steps in the 
pathway chorismic acid tryptophan. 

25 In wild-type E^. col i , the tryptophan operon is under at least three 

distinct forms of control. In the case of promoter-operator repression, 
tryptophan acts as a corepressor and binds to its aporepressor to form 
an active repressor complex which, in turn, binds to the operator, 
closing down the pathway in its entirety. Secondly, by a process of 

30 feedback inhibition, tryptophan binds to a complex of the trp E and trp 
0 polypeptides, prohibiting their participation in the pathway 
synthesis. Finally, control is effected by a process known as 
attenuation under the control of the "attenuator region" of the gene, a 
region within the trp leader sequence. See generally G.F. Miozzari 
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et al, J. Bacteriology 133. 1457 (1978); The Operon 263-302, Cold Spring 
Harbor Laboratory (1978), Miller and Reznikoff, gds.; F. Lee et a^, 
Proc. Natl. Acad. Sci. USA 74. 4365 (1977) and K. Bertrand et ai, J. 
Mol. Biol. jL03, 319 (1975). The extent of attenuation appears to be 
governed by the intracellular concentration of tryptophan, and in 
wild-type E. col i the attenuator terminates expression in approximately 
nine out of ten cases, possibly through the formation of a secondary 
structure, or "termination loop", in the messenger RNA which causes the 
RNA polymerase to prematurely disengage from the associated DNA. 



Other workers have employed the trp operon to obtain some measure 
of heterologous polypeptide expression. This work, it is believed, 
attempted to deal with problems of repression and attenuation by the 
addition of -indole acrylic acid, an inducer and analog which competes 
with tryptophan for trp repressor molecules, tending toward derepression 
15 by competitive inhibition. At the same time the inducer diminishes 
attenuation by inhibiting the enzymatic conversion of indole to 
tryptophan and thus effectively depriving the cell of tryptophan. As a 
result more polymerases successfully read through the attentuator. 
However, this approach appears problematic from the standpoint of 
20 completing translation consistently and in high yield, since 

tryptophan-containing protein sequences are prematurely terminated in 
synthesis due to lack of utilizable tryptophan. Indeed, an effective 
relief of attenuation by this approach is entirely dependent on severe 
tryptophan starvation. 

25. The present invention addresses problems associated with tryptophan 
repression and attenuation in a different manner and provides (1) a 
method for obtaining an expression vehicle designed for direct 
expression of heterologous genes from the trp promoter-operator, (2)' 
methods for obtaining vehicles designed for expression, from the 

30 tryptophan operator-promoter, of specifically cleavable polypeptides 
coded by homologous-heterologous gene fusions and (3) a method of 
expressing heterologous polypeptides controllably, efficiently and in 
high yield, as well as the associated means. 
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SUMMARY OF THE INVENTION 



According to the present invention, novel plasmidic expression 
vehicles are provided for the production in bacteria of heterologous 
polypeptide products, the vehicles having a sequence of double-stranded 

5 DMA comprising, in phase from a first 5' to a second 3* end of the 
coding strand, a trp promoter-operator, nucleotides coding for the trp 
leader ribosome binding site, and nucleotides encoding translation 
initiation for expression of a structural gene that encodes the amino 
acid sequence of the heterologous polypeptide* The DNA sequence referred 

10 to- contains neither a trp attenuator region nor nucleotides coding for 
the trp E ribosome binding site. Instead, the trp leader ribosome 
binding site is efficiently used to effect expression of the information 
encoded by an inserted gene. 

Cells are transformed by addition of the trp promoter-operator-' 

15 containing and attenuator-lacking plasmids of the invention and grown up 
in the presence of additive tryptophan- The use of tryptophan-rich 
media provides sufficient tryptophan to essentially completely repress 
the trp promoter-operator through trp/repressor interactions, so that 
cell growth can proceed uninhibited by premature expression of large 

20 quantities of heterologous polypeptide encoded by an insert otherwise 
under the control of the trp promoter-operator system. When the 
recombinant culture has been grown to the levels appropriate for 
industrial production of the polypeptide, on the other hand, the 
external source of tryptophan is removed, leaving the cell to rely only 

25 on the tryptophan that it can itself produce- The result is mild 

tryptophan limitation and, accordingly, the pathway is derepressed and 
highly efficient expression of the heterologous insert occurs, 
unhampered by attenuation because the attenuator region has been deleted 
from the system. In this manner the cells are never severely deprived 

30 of tryptophan and all proteins, whether they contain tryptophan or not, 
can be produced in substantial yields* 

The invention further provides means of cleaving double-stranded 
0^'A at any desired point, even abse^^t a restriction enzyme site, a 
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technique- useful in, dnong other things, the creation of trp operons 
having attenuator deletions other than those previously obtained by 
selection of mutants. 

Finally, the invention provides a variety of useful intermediates 
5 and endproducts, including specifically cleavable heterologous- 

hoinologous fusion proteins that are stabilized against degradation under 
expression conditions. 

The manner in which these and other objects and advantages of the 
invention are obtained will become more apparent from the detailed 
10 description which follows and from the accompanying drawings in which: 

Figures 1 and 2 illustrate a preferred scheme for forming plasmids 
capable of expressing heterologous genes as fusions with a 
portion of the trp D polypeptide, from which fusion they may 
be later cleaved; 

15 Figure 3 is the result of polyacrylamide gel segregation of cell 

protein containing homologous (trp D') - heterologous 
(somatostatin or thymosin a 1) fusion proteins; 
Figures 4, 5 and 5 illustrate successive stages in a preferred 
scheme for the creation of a plasmid capable of directly 

20 expressing a heterologous gene (human growth hormone) under 

the control of the trp promoter-operator system; 
'*igure 7 is the result of polyacrylamide gel segregation of cell 
protein containing human growth hormone directly expressed 
under the control of the trp promoter-operator system; 

25 Figures 8,9 (d-b) and 10 illustrate in successive stages a 

preferred scherne for the creation of plasmids capable of 
expressing heterologous genes (in the illustrated case, for 
somatostatin) as fusions with a portion of the trp E 
polypeptide, fro.-n v/hich fusions they may be later cleaved; 

30 F'ic'ure 11 is the result of polyacrylamide gel segregation of cell 

prote'^'n cont.^ining homologous (trp E) - heterologous fusion 
proteins for the production of, respec t^"vely, somatostatin, 
thymosin aicha 1, human proinsulin, ard the A and B chains of 
human 'irisul in. 
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Figures 12 and 13 illustrate in successive stages the manner in 
which the plasmid created by the scheme of Figures 3-10 
inclusive is manipulated to form a system in which other 
heterologous genes may be interchangeably expressed- as fusions 
5 with trp E polypeptide sequences. 

In the Figures, only the coding strand of the double-stranded 
plasmid and linear DNAs are depicted in most instance^, for clarity in 
illustration. Antibiotic resistance-encoding genes are denoted ap' 
(ampicillin) and tc*^ (tetracycline). The legend tc^ connotes a gene 

10 for tetracycline resistance that is not under the control of a 

promoter-operator system, such that plasmids containing the gene will 
•nevertheless be tetracycline sensitive. The legend "ap " connotes 
ampicillin sensitivity resulting from deletion of a portion of the gene 
encoding ampicillin sensitivity. Plasmidic promoters and operators are 

15 denoted "p" and "o". The- letters A, T, G and C respectively connote the 
nucleotides containing the bases adenine, thymine, guanine and 
cytosine. Other Figure legends appear from the text. 

The preferred embodiments of the invention described below involved 
use of a number of commonly available restriction endonucleases next 
20 identified, with their corresponding. recognition sequences and 
(indicated by arrow) cleavage patterns. ■ 

Xbal: 



EcoRI: 

25 

Bglll: 
PvuII 
30 BamHI: 



CTA6A 
AGATCjT 

GAATTC 

CTTAAG 
t 

AGATCT 

TCTAGA 
t 

GAGCTG 

6TCGAC 
t 

i 

GGATCC 

CCTAGG 
t 



TaqI : 



Hindi II: 



Hpal: 



Pstl. 



) ■ 

TCGA 

AGCT 
t 

AAGCTT 

TTCGAA 
t 

i 

GTTAAC 

CAATTG 
t 

CTGCAG 

GACGTC 
t 
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Where the points of cleavage are spaced apart on the respective strands 
the cleaved ends will be "sticky", ie, capable of reannealing or of 
annealing to other complementarily "sticky"-ended DNA by Watson-Crick 
base pairing (A to T and G to C) in mortise and tenon fashion. Some 
5 restriction enzymes, such as Hpal and PvuII above, cleave to leave 
"blunt" ends. The nucleotide sequences above are represented in 
accordance with the convention used^throughout: upper strand is the 
protein encoding strand, and in proceeding from left to right on that 
strand one moves from the 5' to the 3' end thereof, ie, in the direction 
10 of transcription from a "proximal" toward a "distal" point. 

Finally with regard to conventions, the symbol "a" connotes a 
deletion. Thus, for example, reference to a plasmid followed by, say, 
"AEcoRI-Xbal" describes the plasmid from which the nucleotide sequence 
between EcoRI and Xbal restriction enzyme sites has been removed by 
15 digestion with those enzymes. For convenience, certain deletions are 
denoted by number. Thus, beginning from the first base pair ("bp") of 
the EcoRI recognition site which precedes the gene for tetracycline 
resistance in the parental plasmid pBR322,- "a1" connotes deletion of 
bpl-30 {ie, AEcoRI-Hind III) and consequent disenabling of the 
20 tetracycline promoter-operator system; "a2" connotes deletion of bp 1-37S 
(ie, AEcoRI-BamHI) and consequent removal of both the tetracycline 
promoter-operator and the structural gene which encodes tetracycline 
resistance; and "a3" connotes deletion of bp 3611-4359 (ie, APstl-EcoRI) 
and elimination of ampicillin resistance. "a4" is used to connote 
25 removal of bp -900 --1500 from the trp operon fragment 5 (Fig. 1). 
eliminating the structural gene for the trp D polypeptide. 



DETAILED DESCRIPTION 

The trp leader sequence is made up of base pairs ("bp") 1-162, 
starting from the start point for trp mRNA. A fourteen amino acid 
jO putative trp leader polypeptide is encoded by bp 27-71 following the AIG 
• nucleotides which encode the translation start signal. The trp 
attenuator region comprises successive GC-rich and AT-rich sequences 
lying between bp 114 and 156 and attenuation is apparently effected on 
mSN'A nucleotides encoded by bp ~134-141 of the leader seos'ence. 
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express a heterologous polypeptide under the direction of the trp leader 
ribosome binding site and at the same time avert attenuation, the 
following criteria must be observed: 

1. Base pairs 134-141 or beyond must be deleted; 
5 2, The ATG codon of the inserted gene must be positioned in 

■ correct relation to a ribosome binding site, as is known (see, 
eg., J/A. Steitz "Genetic signals and nucleotide sequences in 
messenger RNA" in Biological Regulation and Control (ed. R. 
Goldberger) Plenum Press, N.Y, (1978). 
10 3» Where a homologous-heterologous fusion protein fs to be 

produced, the translation start signal of a homologous 
polypeptide sequence should remain available, and the codons 
for the homologous portion of the fusion protein have to be 
inserted in phase without intervening translation stop signals. 
15 For example, deleting all base pairs within the leader sequence 

distal from.bp* 70 removes the attenuator region, leaves the ATG 
sequence which encodes the translation start signal, and eliminates the 
intervening translation stop encoded by TCA (bp. 69-71), by eliminating 
A and following nucleotides. Such a deletion W9uld result in expression 
20 of a fusion protein beginning with the leader polypeptide, ending with 
that encoded by any heterologous insert, and including a distal region 
of one of the post-leader trp operon polypeptides determined by the 
extent of the deletion in the 3' direction. Thus a deletion extending 
into the*.E gene would lead to expression of a homologous precursor 
25 comprising the L sequence and the distal region of E (beyond the 
deletion endpoint) fused to the sequence encoded by any following 
insert, and so on. 

Two particularly useful plasmids from which the attenuator region 
has been deleted are the plasmids pGMl and pGM3, G.F. Miozzari et al, 
30 J> Bacteriology 133 , 1457 (1978). These respectively carry the 

deletions trp aLE 1413 and trp aLE 1417 and express (under the control 
of the trp promoter-operator) a polypeptide comprising approximately the 
first six amino acids of the trip leader and distal regions of the E 
polypeptide. In the most preferred case, pGMl, only about the last 
35 third of the E polypeptide is expressed whereas pGM3 expresses almost 
the distal one half of the E polypeptide codons. E_. coli K-12 strain 
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W3110 tna _2~trp"Al02 containing pGMl has been deposited with the 
American Type Culture Collection (ATCC no, 31522), pGMl may be 
conventionally removed from the strain for use in the procedures 
described below. 

5 Alternatively, deletions may be effected by means provided by the 

invention for specifically cleaving double-stranded DNA at any desired 
site- One example of this cleavage technique appears from Part IV of 
the experimental section, infra . Thus, double-stranded DNA is converted 
to single-stranded DNA in the region surrounding the intended cleavage 
10 point, as by reaction with lambda exonuclease. A synthetic or other 
single-stranded DNA primer is then hybridized to the single-stranded 
length earlier formed, by Watson-Crick base-pairing, the primer sequence 
being such as to ensure that the 5' end thereof will be coterminous with 
the nucleotide on the first strand just prior to the intended cleavage 
15 point. The primer is next extended in the 3' direction by reaction with 
ONA polymerase, recreating that portion of the original double-stranded 
DNA prior to the intended cleavage that was lost in the first step. 
Simultaneously or thereafter, the portion of the first strand beyond the 
intended cleavage point is digested away. To summarize, where "v" marks 
the intended cleavage point: 

;V intended cleavage point "v" 



20 



b) 



made single stranded 
around "v" 



^) ^ v primer hybridization 



d) 



extension from primer 



^) V single strand digestion 

30 
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In the most preferred embodiment, steps (d) and (e) are performed 
simultaneously, using a polymerase that simultaneously digests the 
protruding single stranded end in the 3V> 5' direction and extends the 
primer (in the presence of dATP, dGTP. dTTP and dCTP) in the 5' > 3* 

5 direction. The material preferred for this purpose is Klenow Polymerase 
I, ie, that fragment obtained by proteolytic cleavage of DNA Polymerase 
I which contains the 5' > 3* polymerizing activity and the 3* > 5' 

^ exonucleolytic activity of the parental enzyme, yet lacks its 5* » 3' 
exonucleolytic activity. A. Kornberg, DNA Synthesis , 98, W.H. Freeman 

10 and Co., SFO (1974). 

Using the procedure just described, attenuator deletions may be 
made in any desired manner in a trp operon-containing plasmid first 
linearized by, eg, cleavage at a restriction site downstream from the 
point at which the molecule is to be blunt-ended ("v" above). 
15 Recircularization following deletion of the attenuator region may be 
effected, eg, by blunt end ligation or other manners which will be 
apparent to the art-skilled.. 

Although the invention encompasses direct expression of 
heterologous polypeptide under the direction of the trp promoter- 

20 operator, the preferred case involves expression of fused proteins 
containing both homologous and heterologous sequences, the latter 
preferably being specifically cleavable from the former in 
extra^cellular environs. Particularly preferred are fusions in which 
the homologous portion comprises one or more amino acids of the trp. 

25 leader polypeptide and about one-third or more of the trp E amino acid 
sequence (distal end). Fusion proteins so obtained appear remarkably 
stabilized against degradation under expression conditions. ' . 

Bacteria coJM K-12 strain W3110 tna rtrp"Al02 (pGMl)., ATCC 
No. 31622, rriay be used to aiTiplify stocks of the pGMl plasmid preferably 
e.-npioved U) constructing the attenuator-deficient trp promoter-operator 
30 systerr^s of ihe invention. This strain is phenotyp ical ly trp in the 
presence of anthranilate and can be grew--: in niininial media sucli as LB 
!t::nen':.ed sJto SO uo/ml anthrani lati?. ■ - 
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All bacterial strains used in trp promoter-operator directed 
expression according to. the invention are trp repressor^ ("trp R^") 
as in the case of wild-type £. coli , so as to ensure repression until 
heterologous expression is intended. 

DNA recombination is, in the preferred embodiment, performed in 
E. CQli , K-12 strain -294 (end A, thi". hsr". hsmj^). ATCC No. * 
31446. a bacterial strain whose membrane characteristics facilitate 
transformations. Heterologous polypeptide-producing plasmids grown in 
strain 294 are conventionally extracted and maintained in solution (eg, 
lOmM tris, lm.M EDTA.pHS) at from about -20*C to about A'C. For 
expression under industrial conditions, on the other hand, we prefer a 
more hardy strain, ie, E. coli K-12 x"F" RV 308 str'^, gal 308^ 
ATCC No. 31608. RV 308 is nutritionally wild-type and grows well in 
minimal media, synthesizing all necessary macromolecules from 
conventional mixes of ammonium, phosphate and magnesium salts, trace 
metals and glucose. After transformation of RV 308 culture with strain 
294-derived plasmid the culture is plated on media selective for a 
marker (such as antibiotic resistance) carried by the plasmid, and a 
transformant colony picked and grown in flask culture. Aliquots of the 
latter in 10% DMSO or glycerol solution (in sterile Wheaton vials) are 
shell frozen in an ethanol-dry ice bath and frozen at -SO^C. To produce 
the encoded heterologous polypeptide the culture samples are grown up in 
niedia containing tryptophan so as to repress the trp promoter-operator 
and .the system then deprived of additive tryptophan to occasion 
expression. 

For the first stage of growth one may employ, for example, LB 
medium (J.H. Miller, Experiruents in Molecular Genetics, 433, Cold Spring 
Harbor Laboratory 1972) which contains, per liter aqueous solution, lOg 
Bacto tryptone» 5g Bacto yeast extract and lOg NaCL Preferably, the 
inoculant is grown to optical density ("o.d.") of 10 or more (at 550 
^^M), fTiore preferably to o.d. 20 or more, and most preferably to o,d. 30 
or rnore, albeit to less than stationary phase. 
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For derepression and expression the inoculant is next grown under 
conditions which deprive the cell of additive tryptophan. One 
appropriate media for such growth is M9 {J,H, Miller, supra at 431) 
prepared as follows (per liter): 

5 KH2P0^ . 3g 

Na2HP04 6g 
NaCl 0.5g ' 

NH^Cl Ig 

Autoclave, then -add: *. 

10 10 ml O.OIM CaClg 

1 ml IM MgSO^ . - , . 

10 ml 20 Z glucose 

Vitamin 81 lyg/ml 

Humkq hycase amino 
15 or DIFCO cas, amino acids 40 ug/ml. 

The amino acid supplement is a tryptophan-! acking acid hydrqlysate of 
casein. 

To commence expression of the heterologous polypeptide the 
inoculafit grown in tryptophan-rich media may, eg, be diluted into a - 
larger volume of medium containing no additive tryptophan (for example, 
2-10 fold dilution) grown up to any desired level (preferably short of 
stationary growth phase) and the intended product conventionally 
obtained by lysis, centrifugation and purification. In the 
tryptophan-deprived growth stage, the cells are preferably grown to od 
25 in excess of 10, more preferably in excess of od 20 and most preferably 
to or beyond od 30 (all at 550 nfi) before product recovery. 



20 



All DNA recombination experiments described in the Experimental 
section which follows were conducted at Genentech Inc. in accordance 
with the National Institutes of Health Guidelines for Recombinant DNA 
30 ^'^^search. 
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A preferred method of expressing fusion proteins comprising desired 
polypeptides and, fused thereto, a portion of the amino acid sequence of 
the trp D polypeptide that is separable in vitro by virtue of a 
5 methionine amino acid specifically sensitive to CNBr cleavage, is 
described with reference to Figures 1-3. 

^ ' A. Construction of pBRHtrp 

Plasmid pGMl {I, Fig. 1) carries the E. coli tryptophan operon 
containing the deletion ALE1413 (G.F. Miozzari, et al^, (1978) 
10 BacterioloQv 1457-1466)) and hence expresses a fusion protein comprising 
the first 6 amino acids of the trp leader and approximately the last 
third of the trp E polypeptide (hereinafter referred to in conjunction 
as LE'), as well as the trp D polypeptide in its entirety, all under the 
control of the trp promoter-operator system. The plasmid, 20 yg, was 

15 digested with the restriction enzyme PvuII which cleaves the plasmid at 
five sites. The gene fragments 2 were next combined with EcoRI linkers 
(consisting of a self complementary oligonucleotide 2 the sequence: 
pCATGAATTCATG) providing an EcoRI cleavage site for a later cloning into 
a plasmid containing an EcoRI site (20). The 20 ug of DNA fragments 2 

20 obtained from pGMl were treated with 10 units T^.DNA ligase in the 
presence of 200 pico m9les of the 5'-phosphorylated synthetic 
oligonucleotide pCATGAATTCATG {3} and in 20ul T^ ONA ligase buffer 
(20niM tris, pH 7,6, 0,5 mM ATP, 10 mM MgCU, 5 mM dithiothreitol ) at 
4 C overnight. The solution was then heated 10 minutes at 70'C to halt 

25 ligation. The linkers were cleaved by EcoRI digestion and the 
fragments, now with EcoRI ends were separated using 5 percent 
polyacrylamide gel electrophoresis (herein after "PAGE") and the three 
largest fragments isolated from the gel by first staining with ethidium 
bromide, locating the fragments with ultraviolet light, and cutting from 

30 the gel the portions of interest. Each gel fragment, with*300 
microliters O.lxTBE, was placed in a dialysis bag and subjected to 
electrophoresis at 100 v for one hour in CIxTBE buffer (TSE buffer 
^cop.teinc: iQ.s grn tris base, 5.5 gm boric .-.cid, 0.09 gm- Na^EDTA in I 
1 Uer H^O). The aqueous solution was collected from the dialysis 
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bag, phenol extracted, chloroform extracted and made 0.2 M sodium 
chloride, and the DNA recovered in water after ethanol 
precipitation. [All DNA fragment isolations hereinafter described, 
are performed using PAGE followed by the electroelution method just 
5 discussed]. The trp promoter-operator-containing gene with EcoRI 
sticky ends 5^ was identified in the procedure next described, which 
entails the insertion of fragments into a tetracycline sensitive 
plasmid 6_ which, upon promoter-operator insertion, becomes ^ 
tetracycline resistant. 

10 B. Creation of the plasmid pBRHtrp expressing tetracycline 
resistance under the control of the trp promoter-operator and 
identification and amplification of the trp promoter-operator 
containing DNA fragment 5^ isolated in (A.) above. 

Plasmid pBRHl (^), (R.I. Rodriguez, et a^.* Nucleic Acids 
15 Research ^, 3257-3287 [1979]) expresses ampicilin resistance and 
contains the gene for tetracycline resistance but, there being no 
associated promoter, does not express that resistance. The plasmid 
is accordingly tetracycline sensitive. By introducing a 
promoter-operator system in the EcoRI site, the plasmid can be made 
20 tetracycline resistant. 

pBRHl was digested with EcoRI and the enzyme removed by phenol 
extraction followed by chloroform extraction and recovered in water 
after ethanol precipitation. The resulting DNA molecule 1_ was, in 
separate reaction mixtures, combined with each of the three DNA 

25 fragments obtained in part A, above and ligated with T^ DNA ligase 
as previously described. The ONA present in the reaction mixture was 
used to transform competent E^, col i K-12 strain 294, K. Backman ^ 
al,, Proc NatM Acad Sci USA 21> 4174-4198 [1976]) (ATCC no. 31448) 
by standard techniques (V. Hershf ield e_t aj_« , Proc Nat'l Acad Sci USA 

30 7_1, 3455-3459 [1974]) and the bacteria plated on LB plates containing 
20 ug/ml ampicillin and 5 ug/ml tetracycline. Several 
tetracycl ine-resistant colonies were selected, plasmid DNA isolated 
and the presence of the desired fragment confirmed by restriction 
enzyme analysis. Ihe resulting plasirid 8, designated pBRHtrp, 
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expresses B-lactamase, imparting ampicillin resistance, and it 
contains a DNA fragment including the trp promoter-operator and 
encoding a first protein comprising a fusion of the first six amino 
acids of the trp leader and approximately the last third of the trp E 
5 polypeptide (this polypeptide is designated LE*), and a second 
protein corresponding to approximately the first half of the trp D 
polypeptide (this polypeptide is designated D'), and a third protein 
coded for by the tetracycline resistance gene* 

C. Cloning genes for various end-product polypeptides and expression 
10 of these as fusion proteins comprising end-product and specifically 
cleavable trp D polypeptide precursor (Figure 2). 

A DNA fragment comprising the trp promoter-operator and codons 
for the LE' and D' polypeptides was obtained from plasmid pBRHtrp and 
inserted into plasmids containing structural genes for various 
15 desired polypeptides, next- exempl if ied for the case of somatostatin 
(Figure 2)- 

pBRH trp was digested with EcoRI restriction enzyme and the 
resulting fragment 5_ isolated by PAGE and electroelution. 
EcoRI-digested plasmid pSom 11 (K. Itakura et al. Science 198 , 1055 

20 (1977); G.B. patent publication no. 2 007 676 A) was combined with 
fragment 5_. The mixture was ligated with DNA ligase as 
previously described and the resulting DNA transfonned into £. col i 
K-12 strain 294 as previously described. Transfonnant bacteria were 
selected on ampici 11 in-containing plates. Resulting 

2S ampicill in-resistant colonies were screened by colony hybridization 
(M. Gruenstein et a^. , Proc Nat'l Acad Sci USA _72', 3951-3965 [1975]) 
using as; a probe the trp promoter-operator-containing fragment S 
'So'icted from pBRHtrp, which had been radioact ively labelled with 
Several colonies shown positive by colony hybridization were 

30 ^-'-/j:^•:t,ed, plasmid DNA was isolated and the orientation of the 
i '■■;>• i:; tec- fragments determined by restriction analysis ennploying 
^'v-'' = ■ ";Ct ion enzymes Bglll and BamHl in dcub'ie c-ipestion. E_, col i 29^ 
cori'-^ ning the plasmid designated pS0M7i2, ]\, w:-icr. has the trp 
: ■ er-operator fragment in the desired ri:"-:- . i^*;- -v.-.s grown in L5 
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medium containing 10 ug/ml ampicillin. The cells were grown to 
optical density 1 (at 550 nM), collected by centrifugat'ion and 
resuspended in M9 media in tenfold dilution. Cells were grown for 
2-3 hours, again to optical density 1, then lysed and total cellular 

5 protein analyzed by SOS (sodium dodcyl sulfate) urea (15 percent) 
polyacrylamide gel electrophoresis (J.V. Maizel Jr. ejt a1,, Meth 

• Viral _5, 180-246 [1971]). 

Figure. 3 illustrates a protein gel analysis in which total 
protein from various cultures is separated by size. The density of 

10 individual bands reflects the quantity in whicrt the respective 

proteins are present. With reference to Figure 3, lanes 1 and 7 are 
controls and comprise a variety of proteins of previously determined 
size which serve as points of comparative reference. Lanes 2 and 3 
segregate cellular protein from colonies of IE. coli 294 transformed 

15 with plasmid pSom7 a2 and respectively grown in LB (lane 2) and M9 
(lane 3) media. Lanes 4 and 5 segregate cellular protein obtained 
from similar cells transformed with the plasmid pTha7 a2, a thymosin 
expression plasmid obtained by procedures essentially identical to 
those already described, beginning with the plasmid' pThal (see the 

20 commonly assigned US patent application of Roberto Crea and Ronald B, 
Wetzel, filed February 28, 1980 for Thymosin Alpha 1 Production, the 
disclosure of which 1s incorporated herein by reference). Lane 4 
segregates cellular protein f rem E_. coli 294/pTha7 a2 grown in LB 
media, whereas lane 5 segregates cell protein from the same- 

25 transformant grown in M9 media. Lane 6, another control, is the 
protein pattern of E. coli 294/pBR322 grown in LB. 

Comparison to controls shows the uppermost of the two most 
prominent bands in each of lanes 3 and 5 to be proteins of size 
anticipated in the case of expression of a fusion protein comprising 
30 the 0' polypeptide and, respectively, somatostatin and thymosin (the 
other prominent band represents the LE* polypeptide resulting from 
deletion of the attenuator). Figure 3 confirms that expression is 
■^'epressed in tryp tophan-r ich media, but Gr::repressed under tryptophan 
deficient conditions. 
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D- Cyanogen bromide cleavage and radioimmunoassay for hormone product 

For both the thymosin and somatostatin cases, total cellular 
protein was cyanogen bromide-cleaved, the cleavage product recovered 
• and, after drying, was resuspended in buffer and analyzed by radio- 
5 immunoassay, confirming the expression of product immunologically 
identical, respectively, to somatostatin andr-thymosin. Cyanogen 
bromide cleavage was as described in D,V, Goeddel et jil^. , Proc Nat'l 
•Acad Sci USA 76, 106-110 [1979])- 

•11. Construction of plasmids for direct expression of heterologous 
10 genes under control of the trp promoter-operator system 

The strategy for direct expression entailed creation of a 
plasmid containing a unique restriction site distal from all control 
elements of the trp operon into which heterologous genes could be 
cloned in lieu of the trp leader sequence and in proper, spaced 
15 relation to the trp leader pol^^pept ide' s ribosome binding site. The 
' direct expression approach is next exemplified for the case of human 
growth honnone expression. 

The plasmid pSom7 a2, lOpg, was cleaved w-ith EcoRI and the DMA 
fragment 5_ containing the tryptophan genetic elements was isolated by 

20 PAGE and electroelution. This fragment, 2ug, was digested with the 
restriction endonuclease Taq I, 2 units, 10 minutes at 37*C such 
that, on the average, only one of. the approximately five Taq I sites 
in each molecule is cleaved. This partially digested mixture of 
fcegments was separated by PAGE and an approximately 300 base pair 

25 fragment \2_ (Fig. 4) that contained one EcoRI end and one Taq I end 
v-'cs isolated by electroelution. The corresponding Taq I site is 
located between the transcription start and translation start sites 
and is 5 nucleotides upstream from the ATG codon of the trp leader 
peptide. The DNA sequence about this site is shown in Figure 4. By 

30 proceeding as described, a fragment could be isolated, containing all 
control elements of the trp operon, i.e., promoter-operator system, 

transcription initiation sicnal, and trp leader ribosome binding 
S 1 e . 
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The Taq I residue at the 3' end. of the resulting fragment 
adjacent the translation start signal for the trp leader sequence was 
next converted into an Xbal site, as shown in Figure 5. This was 
done by ligating the fragment 12_ obtained above to a plasmid 

5. containing a unique (i.e., only one) EcoRI site and a unique Xbal 
site. For this purpose, one may employ essentially any plasmid 
containing; in order, a repHcon, a selectable marker such as " 
antibiotic resistance, and EcoRI, Xbal and BamHI sites. Thus, for 
example, an Xbal site^ can be introduced between the EcoRI and -BamHI 

10 sites of pBR322 (F. Bel ivar et al- , Gene 2, 95-119 [1977]) by, e.g., 
cleaving at the plasmid's unique Hind III site with Hind III followed 
by single strand-specific nuclease digestion of the resulting sticky 
ends, and blunt end ligation of a self annealing double-stranded 
synthetic nucleotide containing the recognition site such as 

15 CCTCTAGAGG. Alternatively, naturally derived DNA fragments may be^ 
employed, as was done in the present case, that contain a single Xbal 
site between EcoRI and BamHI cleavage residues. Thus, an EcoRI and 
BamHI digestion product of the viral genome of hepatitis B was 
obtained by conventional means and cloned into the EcoRI and BamHI 

20 sites of plasmid pGH6 (D.V. Goeddel ejt _a2. , Nature 281^, 544 [1979])) 
to form the plasmid pHS32. Plasmid pHS32 was cleaved with Xbal, 
phenol . extracted, chlorofonn extracted and ethanol precipitated. It 
was theii*. treated with 1 ul E. coli polymerase I, Klenow fragment 
(Boehrjnger-Mannheim) in 30 ul polymerase buffer (50 mM potassium 

25 phosphate pH 7.4, 7rTiM MgCl^, 1 mM B-mercaptoethanol ) containing 

OamM dTTP and 0,lmM dCTP for 30 minutes at O'C then 2 hr. at 37^C, 
This treatment causes 2 of the 4 nucleotides complementary to the 5' 
protruding end of the Xbal cleavage site to be filled in: 

5« CTAGA 5' CTAGA 

30 3' T ^ 3' TCT 

T'aO nucleotides, dC and dT, were incorporeted giving an end with two 

protruding nucleotides. This linear residue of plasmid pHS32 
(aftr^r phenol anrf chloroform extraction and recovery in water after 
cthano.l precipi Lction) was cleaved witri EcoRI. The large plasmid 
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fragment _13 was separated from the smaller EcoRI-Xbal fragment by 
PAGE and isolated after electroelution. This DNA fragment from pHS32 
{0*2 vg), was Ugated, under conditions similar to those described 
above, to the EcoRI-Taq I fragment of the tryptophan operon (-0.01 
wg)* as shown in Figure 5. In this process the Taq I protruding end 
is ligated to the Xbal remaining -protruding end even though it is not 
conpletely Watson-Crick base-paired: 

T ^ CTAGA ^TCTAGA 

' AGC TCT ^ AGCTCT 



10 A portion of this ligation reaction mixture was transformed into E. 
coli 294 cells as in part I. above, heat treated and plated on LB 
plates containing ampicillin. Twenty-four colonies were selected, 
grown in 3 ml LB media, and plasmid isolated. Six of these were 
found to have the Xbal site regenerated via E. coli catalyzed DNA 

15 repair and replication: 



-TCTAGA - ^TCTAGA- 



20 



30 



AGCTCT AGATCT 

T"hese plasmids were also found to cleave both with EcoRI and Hpal and 
to give the expected restriction fragments. One plasmid j^, desig- 
nated pTrp 14, was used for expression of heterologous polypeptides, 
as next discussed. 

The plasmid pHGH 107 (18 in Figure 6, D.V. Goeddel et al. Nature , 
181_, 544, 1979) contains a gene for human growth hormone made up of 
23 cmino acid codons produced from synthetic DNA fragments and 163 
amino acid codons obtained from complementary DNA produced via 
reverse tpcnscript ion of human growth hormone messenger RNA. This 
gene though it lacks the codons of the "pre" sequence of human 
growth hormone, does contain an ATG translation initiation codon. 
^he gene was isolated from 10 ug pHGH 107 after treatment with EcoRI 
followed by E. coli polymerase I Klencw fragment and dTTP and dATP as 
described above. Following phenol and chloroform extraction and 
ethanoi p^'ecipitation the plasn-rid was \."ecXe6 vn'th BamHi. See Figure 
6. 
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The human growth hormone ("HGH") gene-containing fragment 21 was 
isolated by PAGE followed by electroelution. The resulting DNA 
fragment also contains the first 350 nucleotides of the tetracycline 
resistance structural gene, but lacks the tetracyline 

5 promoter-operator system so that, when subsequently cloned into an 
expression plasmid, plasmids containing the insert can be located by 
the restoration of tetracycline resistance. Because the EcoRI end of 
the fragment 21 has been i^illed in by the Klenow polymerase I- 
procedure, the fragment has one blunt and one sticky end, ensuring 

10 proper orientation when later inserted into an expression plasmid. 
See Figure 6. 

The expression plasmid pTrpl4 was next prepared to receive the 
HGH gene-containing fragment prepared above. Thus, pTrpl4 was Xfaal 
digested and the resulting sticky ends filled in with the Klenow 

15 polymerase I procedure employing dATP, dTTP, dGTP and dCTP. After 
phenol and chloroform extraction and ethanol precipitation the 
resulting DNA 16^ was treated with BamHI and the resulting large 
plasmid fragment r? isolated by PAGE and electroelution. The 
pTrpl4-derived fragment 17_ had one blunt and one sticky end, 

20 permitting recombination in proper orientation with the HGH gene 
containing fragment 2\_ previously described. 

The HGH gene fragment 21_ and the pTrpl4 AXba-BamHI fragment 17 
were combined and ligated together under conditions similar to those 
described above. The filled in Xbal and EcoRI ends ligated together 
■^5 fay blunt end ligation to recreate both the Xbal and the EcoRI site: 



Anal fiileo in EcoRI filled in HGH gene initiation 
-TCTAG ^ AMICTATG- ^TtlAGkATTCTATG 



TAAGATAC ^GATQ 



ITAA 



3ATAC- 



Xbcl EcoRI 
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This construction also recreates the tetracycline resistance gene. 
Since the plasmid pHGH 107 expresses tetracycline resistance from a 
promoter lying upstream from the HGH gene (the lac promoter), this 
construction 22, designated pHGH 207, permits expression of the gene 
5 for tetracycline resistance under the control of the tryptophan 

promoter-operator. Thus the ligation mixture was transformed into E. 
coli 294 and colonies selected on L3 plates containing 5 ug/ml 
•tetracycline, 

In order to confirm the direct expression of human growth 
10 hormone from plasmid pHGH 207, total cellular protein derived from 
E-coli 294/pHGH 207 that had been grown to optical density 1 in LB 
media containing 10 ug/ml ampicillin and diluted 1 to 10 into M9 
media, and grown again to optical density 1, was subjected to SDS gel 
electrophoresis as in the case of part I. above and compared to 
15 similar electrophoresis data obtained for human growth hormone as 
previously expressed by others (D.V. Goeddel et al, Nature , 281 , 544 
(1979)). Figure 7 is a photograph of the resulting, stained gel 
wherein: Lanes 1 and 7 contain protein markers of various known 
sizes; Lane 2 is a control that separates total cellular protein of 
*0 E. Coli strain 294 pBR322; Lane 3 segregates protein from E. Coli 
294/pHGH 107 grown in LB media; Lane 4 segregates protein from E. 
Coli 294/pHGH 107 grown in M9 media; Lane 5 segregates protein from 
E.^Coli 294/pHGH 207 grown in LB media; and Lane 6 segregates protein 
from E. Coli 294/pHGH 207 grown in M9. The dense band in Lane 6 is 
5 human growth hormone, as shown by comparison to the similar bands in 
Lanes 2-4. As predicted by the invention, the organism E, Coli 
294/pHGH 207 when grown in tryptophan-r ich LB media produces less 
hunian growth hormone by reason of tryptophan repressor/operator 
interact ionSj and when grown in M9 media produces considerably more 
HGH than E. Coli 294/pHGH 107 owing to the induction of the stronger 
tryptophan promoters-operator system vs_ the J_ac promoter-operator 
system in pHGH 107. 

^il. Creation of a general expression p'ltswAc for the direct 
oxnression of heterologous genes under c(. ntr-;-! of the tryptophan 
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The plasmid pHGH 207 created in the preceding section was next 
used to obtain a DNA fragment containing the control elements of the 
tryptophan operon (with the attenuator deleted) and to create a 
plasmid ''expression vector" suitable for the direct expression of 

5 . various structural gene inserts. The strategy for creation of the 
general expression plasmid involved removal of the tryptophan control 
region from pHGH 207 by EcoRI digestion and insertion in the 
EcoRI-digested plasmid pBRHl used in part I. supra. pBRHl, as 
previously noted, is an ampicillin resistant plasmid containing the 

10 tetracycline resistance gene but is tetracycline sensitive because of 
the absence of a suitable promoter-operator system. The resulting 
plasmid, pHKY 1, whose construction is more particularly described 
below and shown in Figure 8,- isboth ampicillin and. tetracycl ine 
resistant, contains the tryptophan promoter-operator system, lacks 

15 the tryptophan attenuator, and contains a unique Xbal site distal 
from the tryptophan promoter-operator. The tryptophan promoter- 
operator and unique Xbal site are bounded by EcoRI sites, such that 
the promoter-operator-Xbal-containing fragment can be removed for 
insertion in other structural gene-containing plasmids. 

20 Alternatively, heterologous structural genes may be inserted, either 
into the Xbal site or (after partial EcoRI digestion) into the EcoRI 
site distal from the tryptophan control region, in either case so as 
to come under the control of the tryptophan promoter-operator system. 

Plasmid pHGH 207 was EcoRI digested and the trp promoter 
25 containing EcoRI fragment 23_ recovered by PAGE followed by 
electroelution, 

Plasmid pBRHl was EcoRI digested and the cleaved ends treated 
with bacterial alkaline phosphatase ("BAP") (1 yg, in 50 mM tris pH 8 
end 10 mM MgCl^ for 30 min. at 65'C) to remove the phosphate groups 

30 on the protruding EcoRI ends. Excess bacterial alkaline phosphatase 
was removed by phenol extraction, chloroform extraction and ethanol 
precipitation. The resulting linear DNA 2ii because it lacks 
phosphates on the protruding ends thereof, will in ligation accept 
only inserts whose complementary sticky ends are phosphorylated but 

35 v/ill not itself rec i rcul ari ze, permitting more facile screening for 
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plasmids containing the inserts. The EcoRI fragment derived from 
pHGH 207 and the linear DNA obtained from pBRHl were combined in the 
presence of ligase as previously described and ligated. A 
portion of the resulting mixture was transformed into E. coli strain 
294 as previously described, plated on LB media containing 5 ug/ml of 
tetracycline, and 12 tetracycline resistant colonies selected. 
Plasfliid was isolated from each colony and examined for the presence 
of a DNA insert by restriction endonuclease analysis employing- EcoRI 
.and Xbal. One plasmid containing the insert was designated pHKYl. 



10 IV. Creation of a plasmid containing the tryptophan operon capable 
of expressing a specifically cleavable fusion protein comprising 6 
amino acids of the trp leader peptide and the last third of the trp E 
polypeptide (designated LE') and a heterologous structural gene 
product. 



15 



20 



The strategy for the creation of a LE' fusion protein expression 
plasmid entailed the following steps: 

a- Prevision of a gene fragment comprising codons for the 
distal region of the LE 'polypeptide- having Bgl 11 and EcoRI 
Sticky ends respectively at the 5' and at the 3' ends of the 
c&ding strand; 

b- Elimination of the codons from the distal region of the LE' 
gene fragment and those for the trp D gene from plasmid SOM 7 a2 
and insertion of the fragment formed in step 1, reconstituting 
the LE' codon sequence iinmediately upstream from 
that for the heterologous gene for somatostatin. 



1. 



With reference r,c Tigyre 9(a), plasmid pSom7 ^2 was Hind III 
digested followed by digestion with lambda exonucleese (a 5' to 
3's>onuc"leese) under conditions chosen so as to digest beyond the Bgl 
I! -ei^vriction site witnin the LE ' encoding region. 20 uQ of Hind 
n,-.:: -.gested pSom 7 t,Z was dissolved in h'jffer [20rT';-l glycine buffer, 
lrr,M MgCl^, Im''; S-n-crc^ptoethanQl j. The rssjiting mixture 
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was treated with 5 units of lambda exonuclease for 60 minutes at room 
temperature- The reaction mixture obtained was then phenol 
extracted, chloroform extracted and ethanol precipitated. 

In order ultimately to create an EcoRI residue at the distal end 
S* of the LE' gene fragment a primer '^^pCCTGTGCATGAT was synthesized 
by the improved phosphotriester method (R. Crea et jiX., Proc Nat'l 
Acad Sci USA 75, 5765 [1978]) and hybridized to the single stranded 
end of the LE' gene fragment resulting from lambda exonuclease 
digestion. The hybridization was performed as next described. 

10 20ug of the lambda exortuclease-treated Hind III digestion 

product of plasmid pSom7 a2 was dissolved in 20ul and combined 
with 6ul of a solution containing approximately 80 picomoles of the 
5'-phosphorylated oligonucleotide described above. The synthetic 
fragment was hybridized to the 3' end of the LE* coding sequence and 

15 the remaining single strand portion of the LE' fragment was filled in 
by the Klenow polymerase I procedure described above, using dATP, 
dTTP, dGTP and dCTP. 

The reaction mixture was heated to 50*C and let cool slowly to 
10 C, whereafter 4yl of Klenow enzyme were added. After IS minute 

20 room temperature incubation, followed by 30 minutes incubation at 
37 C, the reaction was stopped by the addition of 5yl of 0,25 molar 
EDTA. The reaction mixture was phenol extracted, chloroform 
extracted and ethancl precipitated. The DNA was subsequently cleaved 
with the restriction enzyme Bgl II. The fragments were separated by 

25 PAGE, An autoradiogram obtained from the gel revealed a 

P"1abelled fragment of the expected length of approximately 470 
bp, which was recovered by electroelution. As outlined, this 
fragment LE'{d) has a Bgl II and a blunt end coinciding with the 
beginning of the primer. 

^0 The plasmid pThal described in part ](C.) above carries a 

structural gene for thymosin alpha one cloned at its 5' coding strand 
end inro an EcoRI site and at its 3' r:nd into a BamHI site. As shown 
in Figure 9, ihe thymosin gene cont:;ir:s i 5g; !I site as well. 
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Plasmid pThal also contains a gene specifying ampicillin resistance. 
In order to create a plasmid capable of accepting the LE'(d) fragment 
prepared above, pTfial was EcoRI digested followed by Klenow 
polymerase I reaction with dTTP and dATP to blunt the EcoRI 
residues. Bgl II digestion of the resulting product created a linear 
DNA fragment 33 containing the gene for ampicillin resistance and, at 
Its opposite ends, a sticky Bgl II residue and a blunt end. The 
resulting product could be recircularized by reaction with the LE'(d)"^ 
fragment containing a Bgl II sticky end and a blunt end in the 
presence of ligase to form the plasmid pTrp24 (Fig. 9b). In 
doing so, an EcoRI site is recreated at the position where blunt end 
• ligation occurred. 

With reference to Figure 10, successive digestion of pTrp24 with 
Bgl II and EcoRI, followed by PAGE and electroelution yields a 

15 fragment having codons for the LE'(d) polypeptide with a Bgl II 
sticky end and an EcoRI sticky end adjacent its 3' coding terminus. 
The LE'(d) fragment 38 can be cloned into the Bgl II site of plasmid 
pSom7 a2 to form an LE' polypeptide/somatostatin fusion protein 
expressed under the control of the tryptophan promoter-operator, as 

20 shown in Figure 10. To do so requires (1) partial EcoRI digestion' of 
PSom7 a2 in order to cleave the EcoRI site distal to the tryptophan 
promoter-operator, as shown in Figure 10 and (2) proper choice of the 
primer sequence (Figure 9) in order to- properly maintain the codon 
reading frame, and to recreate an EcoRI cleavage site. 

2= Thus, 16 uQ plasmid pSom7 a2 was diluted into 200 yl of buffer 

containing 20 mf^ Tris, pH 7.5, 5 mM MgCl^, 0.02 NP40 detergent, 
lOu m MgCi 3r,d treated with 0.5 units EccRI. After 15 i^inutes at 
3/ the reaction mixture was phenol extracted, chloroform extracted 
-f-anol precipitated and subsequently digested with Bgl II. The 
resulting frag.xer.t 36 isolated by the PAGE procedure followed 
■•^..troe iuticn. This fragment contains the codons "LE'(p)" for 
JX7ir,3! end of the LE" polypeptide, ie, those upstream frci; the 
site. The fragment 2^ was nrrxt ligateC Lo the fr f.gme.-t 38 in 
^'Sencfr cr i,^ QMp, i-igese to fcrr^ t.h,^ pias'^iic rSnr? i,?,yi, which 
■■a'^5for:nat ;oi into coli si--in ;:9^, p;- . : : -j" v described. 
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efficiently produced a fusion protein consisting. of the fully 
reconstituted LE* polypeptide and somatostatin under the control of 
the tryptophan promoter-operator. The fusion protein, from which the 
- somatostatin may be specifically cleaved owing to the presence of a 
5 methionine at the 5' end of the somatostatin sequence was segregated 
by .SOS polyacryl amide gel electrophoresis as previously described. 
The fusion protein product is the most distinct band^pparent in Lane 
6 of Figure 11, discussed in greater detail in Part VI, infra. - 

V* Creation of an expression system for trp LE' polypeptide fusions 
10 wherein tetracycline resistance is placed under the control of the 
tryptophan promoter-operator. 

The strategy for creation of an expression vehicle capable of 
receiving a wide variety of heterologous polypeptide genes for 
expression as trp LE' fusion proteins under the control of the 
15 tryptophan operon entailed construction of a plasmid having the 
follov/ing characteristics: - , 

1. Tetracycline resistance which would be lost in the event of 
the promoter-operator system controlling the genes specifying 
such resistance was excised. 

20 2. Removing the promoter-operator system that controls 

tetracycline resistance, and recircul arizing by ligation to a 
heterologous gene and a tryptophan promoter-operator system in 
proper reading phase with reference thereto^ thus restoring 
tetracycline resistance and accordingly permitting 

25 identification of plasmids containing the heterologous gene 

insert. 

ir: 5.hort, and consistent with the nature of the intended inserts, the 
object was to create a linear piece of ONA having a Pst residue at 
'^ts 3' end and a Bgl II residue at its 5' end, bounding a gene 
30 caoable of specifying tetracycline resistance: wuen brought under the 
TO-^/trol of a promoter-operator syster?.. 
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Thus, with reference to figure 12, plasmid pBR322 was Hind III 
digested and the protruding Hind III ends in turn digested with SI 
nuclease. The SI nuclease digestion involved treatment of 10 ug of 
Hind Ill-cleaved pBR322 in 30 ul SI buffer (0.3 M NaCl, 1 mM ZnCl^, 
25 mM sodium acetate, pH 4.5) with 300 units SI nuclease for 30 
minutes at 15'C. The reaction was stopped by the additon of 1 jil of 
30 X SI nuclease stop solution (0.8M tris base,^0 mM EDTA). The 
mixture was phenol extracted, chloroform extracted and ethanol 
precipitated, then EcoRI digested as previously described and the 
large fragment 45 obtained by PAGE procedure followed by 
electroelution. The fragment obtained has a first EcoRI sticky end 
and a second, blunt end whose coding strand. begins with the 
nucleotide thymidine. As will be subsequently shown, the Sl-digested 
Hind III residue beginning with thymidine can be joined to a Klenow 
15 polymerase I-treated Bgl II residue so as to reconstitute the Bgl II 
restriction site upon ligation. 

Plasmid pSom7 a2, as prepared in Part I above, was Bgl II 
digested and the Bgl II sticky ends resulting made double stranded 
with the Klenow polymerase I procedure using all four deoxynucleotide 
triphosphates. EcoRI cleavage of the resulting product followed by 
PAGE and electroelution of the small fragment 42 yielded a linear 
piece of DNA containing the tryptophan promoter-operator and codons 
of the LE' "proximal" sequence upstream from the Bgl II site 
("LE'(p)"). The product had an EcoRI end and a blunt end resulting 
from filling in the Bgl II site. However, the Bgl II site is 
reconstituted by ligation of the blunt end of fragment 42 to the 
blunt end of fragment 45. Thus, the two fragments were ligated in 
the presence of DNA ligase to form the recircul ari zed plasmid 
pHKY 10 (see Figure 12) which was propagated by transformation into 
30 competent E. colM strain 294 cellr.. Tetracycline resistant cells 
bearing the recombinant plasmid pH.KY 10 were grown up, plasmid DNA 
extracted and digested in turn with Bgl II and Pst followed by 
'is-ifition by the PAGE procedure and electroelution of the large 
frcgnent, a linear piece of DNA having Pst end Bgl il sticky ends. 
? ihis D.N'A frz<;mtrt 49 contains the origin of replication and 



20 



25 
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subsequently proved useful as a first component in the construction 
of plasmids where both the genes coding for trp LE' polypeptide 
fusion proteins and the tst resistance gene are controlled by the trp 
promoter/operator. 

5. . Plasmid pSom? a2a4, as previously prepared in Part IV, could be 
manipulated to provide a second component for a system capable of 
receiving a wide variety of heterologous structural genes. With 
reference to Figure 13, the plasmid was subjected to partial EcoRI 
digestion (see Part IV) followed by Pst digestion and fragment 51 

10 containing the trp promoter/operator was isolated by the PAGE 

procedure followed by electroelution. Partial tcoRI digestion was 
necessary to obtain a fragment which was cleaved adjacent to the 5' 
end of the somatostatin gene but not cleaved at the EcoRI site 
present between the ampicillin resistance gene and the trp promoter 

15 operator. Ampicillin resistance lost by the Pst I cut in the ap"^ 
gene could be restored upon ligation with fragment 51. 

In a first demonstration the third component, a structural gene 
for thymosin alpha-one was obtained by EcoRI and BamHI digestion of 
plasmid pThal, The fragment, 52^, was purified by PAGE and 
20 electroelution. 

The three gene fra.gments 49, 51 and 52 could now be ligated 
together in proper orientation, as depicted in Figure 13, to form the 
plasmid pTha7ali4, which could be selected by reason of the 
restoration of ampicillin and tetracycline resistance. The plasmid, 
25 when transformed into E. coli strain 294 and grown up under ■ 
conditions li'<e those described in Part I, expressed a trp LE' 
polypeptide fusion protein from which thymosin alpha one could be 
specifically cleaved by cyancgen bromide treatment. When other 
heterologous structural genes having EcoRI and BamHI termini were 
sim-iu-rly licatc-d with the pHKYlO-deri ved and pS0M7 A2/i4-deri ved 
couiponents, trp LE' polypeptide fusion proteins containing the 
po iypeptides for which those h-rtero looous genes code were likewise 
eif icienr. ly obtained. Figure 11 iliustreies ... SDS po lyacryl amide 
csl electrophoresis sepa-aticn of total c5"iK;i..-r protein from E. coli 
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strain 294 transformants. the darkest band in each case representing 
the fusion protein product produced under control of the tryptophan 
promoter-operator system. In Figure 11, Lane 1 is a control which 
segregates total cellular protein from E. coH 294/pBR322. Lane 2 
contains the somatostatin fusion product from plasmid pSom7 a2a4 
prepared in Part IV. Lane 3 is the somatostatin-containing 
expression product of ftSom? a1a4. Lane 4 contains the expression 
product of pTha7AU4, whereas Lane 5 contains the product expressed 
from aplasmid obtained when the pHKY-lO-deri ved and pSom7 " 
A2A4-derived fragments discussed. above were ligated with an 
EcoRI/BamHI terminated structural gene encoding human proinsulin and 
prepared in part by certain of us. Lanes 6 and 7 respectively 
contain, as the darkest band, a trp LE' polypeptide fusion protein 
from which can be cleaved the B and A chain of human insulin. The 
insulin B and A structural genes were obtained by EcoRI and BamHI 
digestion of plasmids pIBl and pIAll respectively, whose construction 
IS disclosed in D.V. Goeddel et al., Proc Kaf 1 Acad Sci irSA 7fi 106 
[1979]. Lane 8 contains size markers, as before. 



* * * 



^'^'^^ ^-^^ invention in its most preferred embodiment is 
20 described with reference to E. coH, other enterobacteriaceae could 
likewise serve as host cells for expression and as sources for trp 
operons, among which may be mentioned as examples Salmonella 
^^nhimv^ end Se.-ratia marcesans . Thus, the invention is not to be 
yrnned to the preferred embodiments described, but only by the 
-^5 iiiv.ful scope of the appended claims. 
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CLAIMS: 

1- A method of creating an expression plasmid for the 
expression of a heterologous gene which comprises the 
simultaneous ligation, in phase^ of : 

(a) a first linear double-stranded DNA fragment 
containing a replicon and a gene which expresses a 
"feielectable characteristic when placed under the 
direction of a bacterial promoter^ said fragment 
lacking any such promoter; 

(b) a second linear double-stranded DNA fragment 
comprising said heterologous gene; and 

(c) a third double-stranded DNA fragment which comprises 
a bacterial promoter; 

the ligatable ends of said fragments being configured such 
that upon ligation to form a replicable plasmid both the gene 
for the selectable characteristic and the heterologous gene 
come under the direction of the promoter, thus permitting use 
of the selectable characteristic in selection of transformant 
bacteria colonies capable of expressing the heterologous gene. 

2- The jrethod of claim 1 wherein the selectable 
characteristic is antibiotic resistance, 

3. The jTiethod of claim 2 wherein the selectable 
characteristic is tetracycline resistance and wherein the 
bacterial promoter is the trp promoter. 

The iTietnod of clain3 3 wherein ligation reconstitutes an 
•^^rx-^ron for the Gxpressicn of ampicilJin resistance as well, 

■ A Ji^ethcG of cleaving double stranded DNA at any given 
po:r.:i v;l:icr. cojn;: rises: 

^-:} ::c,f- ver::j;iC the double stranced DNA to sinqle- 

sr-rcndec DNA jn a region sur ::c\:r)d ing aaid point; 
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(b) hybridizing to the single-stranded region formed in 
step (a) a complementary primer length of single- 
stranded Dm, the 5' end of the primer lying 
opposite the nucleotide adjoining the intended 
cleavage site; 

(c) restoring that portion of the second strand • 

^ .eliminated in step (a) which lies in the 3' direction 
from said primer by reaction with DNA polymerase in 
the presence of adenine, thymine, guanine and 
cytosine-containing deoxynucleotide triphosphates; 
and 

(d) digesting the remaining single-stranded length of 
DNA which protrudes beyond the intended cleavage 
point. 

6- The method of claim 5 wherein steps (c) and (d) are 
performed simultaneously by reaction with Dm polymerase which 
polymerizes in the direction of 5' 3', is exonucleolytic in the 
direction of 3 * -> 5 ' , but non-exOTuclGoly tic in the direction of 5 ' 3 ' . 

"7- The method of claim 6 v;herein the polymerase is Klenow 
Polyrnerase I. 

^* A plasmidic expression vehicle for the production in 
E. coli bacteria of a heterologous polypeptide product, said 
venicle having a sequence of double-stranded DNA comprising, 
in phase from a first 5 ^ to a second 3' end of the coding 
strand t-hereof, the elements: 

(i.) a bacterial trp promoter-operator system; 
(xi) nucleotides coding for a ribosonie binding site for 
translation of element (iv); 
(:^:^i;* nucleotides coding for a translation start signal 
for translation of ele.T:ent (iv); and 
(•IV) a structural gene encoding the amino acid sequence 
Oi a heterologous polvoept ice : 
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said sequence. comprising neither any trp attenuation 
capability nor nucleotides coding for the trp E ribosome 
binding site. 

9- The method of producing a polypeptide product by the 
expression in bacteria of a structural gene coding therefor 
which comprises: ' 

(a) providing a bacterial inoculant transformed with a 
replicable plasmidic expression vehicle having a 
sequence of double-stranded DNA comprising, in 
phase from a first 5 ' to a second 3' end of the 
coding strand thereof, the elements: 

(i) a bacterial trp promoter-operator system; 
(ii) nucleotides coding for a ribosome binding 
site for translation of element (iv) ; 
(iii) nucleotides coding for a translation start 
signal for translation of element (iv); and 
(iv) a structural . gene encoding the amino acid 
sequence of a heterologous polypeptide; 
said sequence comprising neither any trp attenuation 
capability nor nucleotides coding for the trp E 
ribosome binding site; 

(b) placing. the transformed inoculant in a fermentation 
vessel and growing the same to a predetermined level 
in suitable nutrient media containing additive 
tryptophan sufficient in quantity to repress said 
promoter-operator system; and 

(c) depriving said bacteria of said additive so as to 
derepress said system and occasion the expression of 
the product for which said structural gene codes. 

10. The vehicle of claim 8 or method of claim 9 wherein the 
polypeptide expressed by said structural gene is entirely 
heterologous. 
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11. The vehicle of claim 8 or the method of claim 9 wherein 
the polypeptide expressed is a fusion protein comprising a 

. heterologous polypeptide and at least a portion of the amino 
acid sequence of a homologous polypeptide. 

12. The vehicle or method of claim 11- wherein said portion is 
a portion of the amino acid sequence of an enzyme involved in 
the biosynthetic pathway from chorismic acid to tryptophan. 

13. The vehicle or method of claim 12 wherein the heterologous 
polypeptide is a bioactive polypeptide and the fused homologous 
polypeptide is a specifically cleavable bioinactivating 
polypeptide. 

14. The vehicle or method of claim 11 wherein the homologous 
polypeptide is the trp E polypeptide and wherein said ribosome 
binding site is the ribosome binding site for the trp leader 
polypeptide. 

15. The vehicle or method of claim 11 wherein the homologous 
polypeptide is the trp D polypeptide. 

16. The vehicle or method of claim 14 wherein the fusion 
protein comprises an heterologous polypeptide and a homologous 
polypeptide which itself constitutes a fusion of about the 
first six amino acids of the trp leader polypeptide and the 
amino acid sequence encoded by at least about the distal 
third of the trp E polypeptide gene. 

^'^l vehicle or claim 8 or method of claim 9 wherein the 

heterologous polypeptide comprises a recoverable polypeptide 
selected froir. the group consisting of human growth hormone, 
hur.an proinsujin, somatostatin, thymosin alpha 1, the A chain 
of hur.^an insulin and the B chain of human insulin. 
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18. The method of claim 8 wherein tryptophan deprivation is 
effected by cessation of addition of said additive and by 
dilution of the fermentation media in which said inoculant is 
first grown up. 

19- The method of claim 18 wherein the host bacteria is 
E*. coll. 



20. The plasmids pBRHtrp, pSOM7A2, pHGH207, pHKYl, pSOM7A2A4, 
pThya7AlA4, and pTha7A2. 
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Description 

Background of the invention 

With the advent of recombinant DNA tech- 
nology, the controlled bacterial production of an 5 
enormous variety of useful polypeptides has 
become possible. Already In hand are bacteria 
modified by this technology to permit the produc- 
tion of such polypeptide products such as 
somatostatin (K. itakura, et al., Science 198. 1056 
[1977]), the (component) A and B chains of human 
insulin (D.V. Goeddel, a/., Proc Nat'l Acad Sci, 
USA 76, 106 [1979]), and human growth hormone 
(D.V. Goeddel, ef a/.. Nature 544 (19791). More 
recently, recombinant DNA techniques have been is 
used to occasion the bacterial production of 
thymosin alpha 1, an immune portentiating sub- 
stance produced by the thymus. Such is the power 
of the technology that virtually any useful polypep- 
tide can be bacterially produced, putting within 20 
reach the controlled manufacture of hormones, 
enzymes, antibodies, and vaccines against a wide 
variety of diseases. The cited materials, which 
describe in greater detail the representative 
examples referred to above, are incorporated 25 
herein by reference, as are other publications 
referred to Infra, to illuminate the background of 
the invention. 

The work horse of recombinant DNA technology 
is the plasmid, a non-chromosomal loop of 30 
double-stranded DNA found in bacteria, often- 
times in multiple copies per bacterial cell. Included 
in the information encoded in the plasmid DNA is 
that required to reproduce the plasmid in daughter 
cells (i.e., a"replicon") and ordinarily, one or more 35 
selection characteristics, such as resistance to 
antibiotics, which permit clones of the host cell 
containing the plasmid of interest to be recoginzed 
and preferentially grown in selective media. The 
utility of bacterial plasmids lies in the fact that they 40 
can be specifically cleaved by one or another 
restriction endonuclease or "restriction enzyme", 
each of which recognizes a different site on the 
plasnrtidic DNA. Thereafter heterologous genes or 
gene fragments may be inserted into the plasmid 45 
by endwise joining at the cleavage site or at 
reconstructed ends adjacent the cleavage site. As 
used herein, the term "heterologous" refers to a 
gene not ordinarily found in, or a polypeptide 
sequence ordinarily not produced by, £ coH, so 
whereas the term "homologous" refers to a gene 
or polypeptide which is produced in wild-type £ 
coiL DNA recombination is performed outside the 
bacteria, but the resulting "recombinant" plasmid 
can be introduced Into bacteria by a process ss 
known as transformation and large quantities of 
the heterologous gene-containing recombinant 
plasmid obtained by growing the transformant. 
Moreover, where the gene is properly inserted 
with reference to portions of the plasmid which so 
govern the transcription and translation of the 
encoded DNA message, the resulting expression 
vehicle can be used to actually produce the 
polypeptide sequence for which the inserted gene 
codes, a process referred to as expression. es 



Expression is initiated in a region known as the 
promoter which is recognized by and bound by 
RNA polymerase. In some cases, as In the trp 
operon discussed infra, promoter regions are 
overlapped by "operator" regions to form a com- 
bined promoter-operator. Operators are DNA 
sequences which are recognized by so-called 
repressor proteins which serve to regulate the 
frequency of transcription initiation at a particular 
promoter. The polymerase travels along the DNA, 
transcribing the information contained in the 
coding strand from its 5' to 3' end into messenger 
RNA which is in turn translated into a polypeptide 
having the amino acid sequence for which the 
DNA codes. Each amino acid is encoded by a 
unique nucleotide triplet or "codon" within what 
may for present purposes be referred to as the 
"structural gene", i.e. that part which encodes the 
amino acid sequence of the expressed product. 
After binding to the promoter, the RNA 
polymerase first transcribes nucleotides encoding 
a ribosome binding site, then a translation initia- 
tion or "start" signal (ordinarily ATG, which in the 
resulting messenger RNA becomes AUG), then the 
nucleotide codons within the structural gene itself. 
So-called stop codons are transcribed at the end of 
the structural gene whereafter the polymerase 
may form an additional sequence of messenger 
RNA which, because of the presence of the stop 
signal, will remain untranslated by the ribosomes. 
Ribosomes bind to the binding site provided on 
the messenger RNA, in bacteria ordinarily as the 
nRNA is being formed, and themselves produce 
the encoded polypeptide, beginning at the transla- 
tion start signal and ending at the previously 
mentioned stop signal. The desired product is 
produced if the sequences encoding the ribosome 
binding site are positioned properly with respect 
to the AUG initiator codon and if all remaining 
codons follow the initiator codon in phase. The 
resulting product may be obtained by lysing the 
host cell and recovering the product by approp- 
riate purification from other bacterial protein. 

Polypeptides expressed through the use of 
recombinant DNA technology may be entirely 
heterologous, as in the case of the direct express- 
ion of human growth hormone, or alternatively 
may comprise a heterologous polypeptide and, 
fused thereto, at least a portion of the amino acid 
sequence of a homologous peptide, as in the case 
of the production of intermediates for somatosta- 
tin and the components of human insulin. In the 
latter cases, for example, the fused homologous 
polypeptide comprised a portion of the amino acid 
sequence for beta galactosidase. In those cases, 
the intended bioactive product is bioinactlvated by 
the fused, homologous polypeptide until the latter 
is cleaved away in an extracellular environment 
Fusion proteins like those just mentioned can be 
designed so as to permit highly specific cleavage 
of the precursor protein from the intended pro- 
duct, as by the action of cyanogen bromide on 
methionine, or alternatively by enzymatic cleav- 
age. See, eg., G.B. Patent Publication No. 
2 007 676 A- 
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The present invention is directed to the creation 
of expression plasmids for the expression of 
heterologous genes in bacteria. The procedure of 
the present invention is illustrated by the construc- 
tion of an expression vehicle designed for direct 
expression of heterologous genes from the trp 
pronnoter-operator, the illustrated procedure 
embodying inventions which are the subject of 
divisional European Applications EP 86548A and 
EP 154133A. 

According to the present invention there is 
provided a method of creating an expression 
plasmid for the expression of a heterologous gene 
which comprises the simultaneous ligation, in 
phase, of: 

(a) a first linear double-stranded DNA fragment 
containing a replicon and a gene which expresses 
a selectable characteristic when placed under the 
direction of a bacterial promoter, said fragment 
lacking any such promoter, said first fragment 
having ligatable ends capableof Hgating to itself or 
to fragment (b) or (c); 

(b) a second linear double-stranded DNA frag- 
ment comprising said heterologous gene, said 
second fragment having ligatable ends capable of 
ligating to itself or to fragment(a) or (c); and 

(c) a third double-stranded DNA fragment which 
comprises a bacterial promoter, said third frag- 
ment having ligatable ends capable of ligating to 
itself or to fragment (a) or ib); 

the ligatable ends of said fragments being con- 
figured so as to be capable of ligating to form a 
replicable plasmid in which both the gene for the 
selectable characteristic and the heterologous 
gene come under the direction of the promoter 
with the heterologous gene lying transcriptionally 
downstream of the promoter and upstream of the 
selectable characteristic gene, the latter being 
incapable of functional ligation to the promoter 
fragment other than via fragment (b) wherein the 
heterologous gene is functionally linked to the 
promoter, thus permitting use of the selectable 
characteristic in selection of transformant bacteria 
colonies capable of expressing the heterologous 
gene. The selectable characteristic is preferably 
antibiotic resistance, for example tetracycline 
resistance. In a preferred embodiment the select- 
able characteristic is tetracycline resistance and 
the bacterial promoter is the trp promoter, ligation 
preferably reconstituting an operon for the 
expression of ampicillin resistance as well. 

The triple ligation of three synthetic DNA frag- 
ments, whose ligatable ends are configured so 
that they can join together only in the desired 
fashion to create a synthetic gene is known in the 
prior art (Goeddel et al. Nature 281 (1979) 
544-548). 

In the accompanying drawings; 

Figures 1 and 2 illustrate in successive stages the 
manner in which an expression plasmid created by 
the method of the invention to form a system in 
which other heterologous genes may be inter- 
changeably expressed as fusions with trp E poly- 
peptide sequences. 

In the figures. Antibiotic resistance-encoding 



genes are denoted Ap"(ampicillin) and Tc" (tet- 
racycline). The legend "Ap*" connotes ampicillin 
sensitivity resulting from deletion of a portion of 
the gene encoding ampicillin sensitivity. Piasmidic 

5 promoters and operators are denoted "p" and "o". 
Finally with regard to conventions, the symbol 
"A" connotes a deletion. Thus, for example, 
reference to a plasmid followed by, say, 
"AEcoRI— Xbal" would describe the plasmid from 

w which the nucleotide sequence between EcoRI and 
Xbal restriction enzyme sites has been removed by 
digestion with those enzymes. For convenience, 
certain deletions are denoted by number. Thus, 
beginning from the first base pair ("bp") of the 

15 EcoRI recognition site which precedes the gene for 
tetracycline resistance in the parental plasmid 
pBR322, "Al" connotes deletion of bp 1—30 (ie, 
AEcoRI — Hind HI) and consequent disenabling of 
the tetracycline promoter-operator system; "A2" 

20 connotes deletion of bp 1—375 (ie, 
AEcoRI— BamHI) and consequent removal of both 
the tetracycline promoter-operator and the struc- 
tural gene which encodes tetracycline resistance; 
and "A3" would connote deletion of bp 

25 3611—4359 (ie, APstI— EcoRi) and elimination of 
ampicillin resistance. "A4"' is used to connete 

removal of bp ^900 1500 from the trp operon 

fragment eliminating the structural gene for the trp 
D polypeptide. 

30 A more detailed description of the Figure 
legends, and of the experimental and theoretical 
background to the work exemplified below, is to be 
found in the divisional applications (Supra). 

35 Example 

Creation of an expression system for trp LE' 
polypeptide fusions wherein tetracycline 
resistance is placed under the control of the 
tryptophan promoter-operator. 

40 The strategy for creation of an expression 
vehicle capable of receiving a wide variety of 
heterologous polypeptide genes for expression as 
trp LE' fusion proteins under the control of the 
tryptophan operon entailed construction of a 

45 plasmid having the following characteristics: 

1 . Tetracycline resistance which would be lost in 
the event of the promoter-operator system con- 
trolling the genes specifying such resistance was 
excised. 

50 2. Removing the promoter-operator system that 
controls tetracycline resistance, and recirculariz- 
ing by ligation to a heterologous gene and a 
tryptophan promoter-operator system in proper 
reading phase with reference thereto, thus restor- 

55 ing tetracyline resistance and accordingly permit- 
ting identification of plasmids containing the 
heterologous gene insert. 

In short, and consistent with the nature of the 
intended inserts, the object was to create a linear 

60 piece of DNA having a Pst residue at its 3' end and 
a Bgl II residue at its 5' end, bounding a gene 
capable of specifying tetracycline resistance when 
brought under the control of a promoter-operator 
system. 

65 Thus, with reference to figure 1, plasmid pBR322 
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was Hind lil digested and the protruding Hind lit 
ends in turn digested with SI nuclease. The SI 
nuclease digestion involved treatment of 10 tig of 
Hind Ill-cleaved pBR322 in 30 ^1 S1 buffer (0.3 M 
NaCI, 1 mM ZnCU, 25 mM sodium acetate, pH 4.5) 
with 300 units SI nuclease for 30 minutes at 15X. 
The reaction was stopped by the addition of 1 [il 
of 30 X SI nuclease stop solution (0.8M tris base, 
50 mM EDTA). The mixture was phenol extracted, 
cholorform extracted and ethanol precipitated, 
then EcoRI digested as previously described and 
the large fragment 46 obtained by PAGE proce- 
dure followed by electroelution. The fragment 
obtained has a first EcoRI sticky end and a second, 
blunt end whose coding strand begins with the 
nucleotide thymidine. As wilt be subsequently 
shown, the 81 -digested Hind 111 residue beginning 
with thymidine can be joined to a Klenow 
polymerase l-treated Bgl II residue so as to recon- 
stitute the Bgl II restrction site upon ligation. 

Plasmid pSorn? A2, as prepared in EP154133A 
was Bgl II digested and the BGl II sticky ends 
resulting made double stranded with the Klenow 
polymerase I procedure using all four deoxynuc- 
leotide triphosphates. EcoRI cleavage of the 
resulting product followed by PAGE and elec- 
troelution of the small fragment 42 yielded a 
linear piece of DNA containing the tryptophan 
promoter-operator and codons of the LE' "proxi- 
mal" sequence upstream from the BGl II site 
("LE'(p)"). The product had an EcoRI end and a 
blunt end resulting from filling in the BGl II site. 
However, the BGl II site is reconstituted by liga- 
tion of the blunt end of fragment 42 to the blunt 
end of fragment 46. Thus, the two fragments were 
ligated in the presence of T^ DNA ligase to form 
the recircularized plasmid pHKY 10 (see Figure 1) 
which was propagated by transformation into 
competent £ coii strain 294 cells. Tetracycline 
resistant cells bearing the recombinant plasmid 
pHKY 10 were grown up, plasmid DNA extracted 
and digested in turn with Bgl II and Pst followed 
by isolation by the PAGE procedure and elec- 
troelution of the large fragment, a linear piece of 
DNA having Pst and Bgl 11 sticky ends. This DNA 
fragment 49 contains the origin of replication and 
subsequently proved useful as a first component 
in the construction of plasmids where both the 
genes coding for trp LE' polypeptide fusion pro- 
teins and the tet resistance gene are controlled by 
the trp promoter/operator. 

Plasmid pSorn? A2A4, as prepared in EP 
154133A, could be manipulated to provide a 
second component for a system capable of 
receiving a wide variety of heterologous struc- 
tural genes. With reference to Figure 2, the pias- 
mld was subjected to partial EcoRI digestion 
followed by Pst digestion and fragment 51 con- 
taining the trp promoter/operator was isolated by 
the PAGE procedure followed by electroelution. 
Partial EcoRI digestion was necessary to obtain a 
fragment which was cleaved adjacent to the 5' 
end of the somatostatin gene but not cleaved at 
the EcoRI site present between the ampicilin 
resistance gene and the trp promoter operator. 



Ampiciltin resistance lost by the Pst I cut in the 
Ap" gene could be restored upon ligation with 
fragment 45. 
In a first demonstration the third component, a 
5 structural gene for thymosin alpha-orw, was 
obtained by EcoRI and BamHI digestion of plas- 
mid pThal (see EP154133A). The fragment, 52. 
was purified by PAGE and electroelution. 
The three gene fragments 49, 51 and 52 could 
w now be ligated together in proper orientation, as 
depicted in Figure 2, to form the plasmid 
pTha7A1A4, which could be selected by reason of 
the restoration of ampicillin and tetracycline 
resistance. The plasmid, when transformed into 
15 E. coli strain 294 and grown up under conditions 
like those described in Part I, expressed a trp LE' 
polypeptide fusion protein from which thymosin 
alpha one could be specifically cleaved by cyano- 
gen bromide treatment. When other heterologous 
20 structural genes having EcoRI and BamHI termini 
were similarly ligated with the pHKYIO-derived 
and pS0M7 A2A4-derived components, trp LE' 
polypeptide fusion proteins containing the poly- 
peptides for which those heterologous genes 
25 code were likewise efficiently obtained. 

Claims 

1. A method of creating an expression plasmid 
30 for the expression of a heterologous gene which 
comprises the simultaneous ligation, in phase, of: 

(a) a first linear double-stranded DNA fragment 
containing a repiicon and a gene which expresses 
a selectable characteristic when placed under the 

35 direction of a bacterial promoter, said fragment 
lacking any such promoter, said first fragment 
having ligatable ends capable of ligating to itself 
or to fragment (b) or (c); 

(b) a second linear double-stranded DNA frag- 
40 ment comprising said heterologous gene, said 

second fragment having ligatable ends capable of 
ligating to itself or to fragment (a) or (c); and 

(c) a third double-stranded DNA fragment 
which comprises a bacterial promoter, said third 

45 fragment having ligatable ends capable of ligat- 
ing to itself or to fragment (a) or (b); 

the ligatable ends of said fragments being 
configured so as to be capable of ligating to form 
a replicable plasmid in which both the gene for 

50 the selectable characteristic and the heterologous 
gene come under the direction of the promoter 
with the heterologous gene lying transcriptionally 
downstream of the promoter and upstream of the 
selectable characteristic gene, the latter being 

55 incapable of functional ligation to the promoter 
fragment other than via fragment (b) wherein the 
heterologous gene Is functionally linked to the 
promoter, thus permitting use of the selectable 
characteristic in selection of transformant bac- 

60 teria colonies capable of expressing the 
heterologous gene. 

2. The method of claim 1 wherein the selectable 
characteristic is antibiotic resistance. 

3. The method of claim 2 wherein the selectable 
55 characteristic is tetracycline resistance and 
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wherein the bacterial promoter is the trp pro- 
moter. 

4. The method of claim 3 wherein ligation 
reconstitutes an operon for the expression of 
ampicillin resistance as well. 

5. A method of any one of the preceding claims, 
wherein the product of ligation is transformed 
into bacterial host, and the bacterial host is 
cultured in a selective medium. 

Patentanspruche 

1. Ein Verfahren zur Erzeugung eines Expres- 
sionsplasmids fur die Expression eines heterolo- 
gen Gens, das die gleichzeige Ligation in Phase 

(a) eines ersten linearen doppelstrangigen 
DNA-Fragments, das ein Replikon und ein Gen 
enthalt, das ein selektierbares Merkmal expri- 
miert, wenn es der Leitung eines bakterielien 
Promotors unterstelit wird, wobei das genannte 
Fragment einen solchen Promotor nicht aufweist, 
wobei das erste Fragment ligierbare Enden auf- 
welste, die fahig sind, an es selbst oder Fragment 
(b) Oder (c) zu iigieren; 

(b) eines zweiten linearen doppelstrangigen 
DNA-Fragments, umfassend das genannte hete- 
rologe Gen, wobei das genannte zweite Fragment 
ligierbare Enden aufweist, die fahig sind, an es 
selbst Oder an Fragment (a) oder (c) zu Iigieren; 
und 

(c) eines dritten doppelstrangigen DNA-Frag- 
ments, das einen bakterielien Promotor enthalt, 
wobei das genannte dritte Fragment ligierbare 
Enden aufweist, die fahig sind, es selbst oder 
Fragment (a) oder (b) zu Iigieren; wobei die 
ligierbaren Enden der genannten Fragmente so 
ausgebildet sind, daS sie fahig sind, zu Iigieren, 
um ein replizierbares Plasmid zu bilden, in dem 
sowohl das Gen fur das selektierbare Merkmal als 
auch das heterotoge Gen der Leitung des Promo- 
tors unterworfen werden, wobei das heterologe 
Gen transskriptionell stromabwarts von Promotor 
und stromaufwarts vom Gen fur das selektierbare 
Merkmal geiegen ist, wobei letzteres Gen zur 
funktionellen Ligation an das Promotorfragment 
nur uber das Fragment (b) fahig ist, worin das 
heterologe Gen funktionel! an den Promotor 
gekoppelt ist, wodurch die Verwendung des 
selektierbaren Merkmals bei der Selektion von 
transformanten Bakterienkolonien moglich ist, 
die fahig sind, das heterologe Gen zu exprimie- 
ren. 

2. Das Verfahren nach Anspruch 1, worin das 
selektierbare Merkmal Antibiotikaresistenz ist 

3. Das Verfahren nach Anspruch 2, worin das 
selektierbare Merkmal Tetracyclinresfstenz und 
der bakterielle Promotor der trp-Promotor ist, 

4. Das Verfahrn nach Anspruch 3, worin die 
Ligation aulSerdem ein Operon fur die Expression 
von Ampicillinresistenz rekonstituiert. 

5. Ein Verfahren nach einem der vorhergehen- 
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den Anspruche, worin das Produkt der Ligation in 
einen bakterielien Wirt transformiert und der bak- 
terielle Wirt einem selektiven Medium kultiviert 
wird. 

5 

Revendicattons 

1. Methode pour la creation d'un plasmide 
d'expression pour Texpression d'un gene hetero- 
w logue qui comprend la ligature simultanee, en 
phase, de: 

(a) un premier fragment d'AdN lineaire a deux 
brins contenant un replicon et un gene qui 
exprime une caracteristique pouvant etre selec- 

15 tionnee lorsqu'il est place sous la direction d'un 
promoteur bacterien, ledit fragment manquant de 
ce promoteur, ledit premier fragment ayant des 
extremites pouvant §tre ligaturees, capables de 
se tigaturer ^ elles-memes ou au fragment (b) ou 

20 (c); 

(b) un second fragment d'ADN lineaire a deux 
brins comprenant ledit gene heterologue, ledit 
second fragment ayant des extremites pouvant 
etre ligaturees, capables de se ligaturer a elles- 

25 m§mes ou au fragment (a) ou (c); 

(c) un troisieme fragment d'ADN h dux brins qui 
comprend un promoteur bacterien, ledit troi- 
sidme fragment ayant des extremites pouvant 
etre ligaturees, capables de se ligaturer ^ elles- 

30 memes ou au fragment (a) ou (b); 

ies extremites desdits fragments pouvant etre 
ligaturees etant configurees afin d'dtre capables 
de se ligaturer pour former un plasmide replica- 
ble ou h la fois le gdne pour la caracteristique 

35 pouvant etre selectionnee et le g^ne h6t6rologue 
viennent sous la direction du promoteur avec ie 
gene heterologue se trouvant, par transcription, 
en aval du promoteur et en amont du g^ne de la 
caracteristique pouvant etre selectionnee, ce der- 

40 nier etant incapable d'une ligature fonctionnelle 
au fragment promoteur autre que via I fragment 
(b) oCi le gene heterologue est fonctionnellement 
lie au promoteur, permettant ainsi rutiiisation de 
la caracteristique pouvant etre selectionnee, pour 

45 la selection de colonies de bacteries transfor- 
mantes capables d'exprimer le g§ne heterologue. 

2. M§thode selon la revendication 1 oCi la 
caracteristique pouvant etre selectionnee est la 
resistance aux antibiotiques. 

50 3. Methode selon la revendication 2 ou la 
caracteristique pouvant etre selectionnee est la 
resistance h la tetracycline et ou le promoteur 
bacterien est le promoteur trp, 

4. Methode selon la revendication 3 ou la 
55 ligature reconstitue un operon pour I'expression 

de ta resistance d Tampicilline egalement 

5, Methode selon Tune quelconque des reven- 
dications precedentes ou le produit de la ligature 
est transforme dans un hdte bacterien et I'hote 

60 bacterien est mis en culture dans un milieu 
seiectif. 
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